Fitness-dependent topological properties of the World Trade Web 
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Among the proposed network models, the hidden variable (or good get richer) one is particularly 
interesting, even if an explicit empirical test of its hypotheses has not yet been performed on a real 
network. Here we provide the first empirical test of this mechanism on the world trade web, the 
network defined by the trade relationships between world countries. We find that the power-law 
distributed gross domestic product can be successfully identified with the hidden variable (or fitness) 
determining the topology of the world trade web: all previously studied properties up to third-order 
correlation structure (degree distribution, degree correlations and hierarchy) are found to be in 
excellent agreement with the predictions of the model. The choice of the connection probability is 
such that all realizations of the network with the same degree sequence are equiprobable. 
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Networks emerge almost ubiquitously in complex sys- 
tems, from the cell to the Internet and the economy [1-3]. 
Despite differences in their nature, many real- world net- 
works are characterized by similar topological properties, 
strikingly different from those displayed by simple ran- 
dom graphs. This finding has motivated the search for 
theoretical models aimed at understanding the mecha- 
nisms at the basis of network organization. Currently, 
such models can be grouped in two main classes: a first 
class exploiting the rich get richer or preferential attach- 
ment mechanism [1] , where the topology evolves in such 
a way that already well connected vertices become more 
and more connected in a multiplicative fashion, and a sec- 
ond class based on the good get richer [4] or hidden vari- 
able [4,5] hypothesis, where vertices are assumed to be 
characterized by an intrinsic quantity or fitness which de- 
termines their connection probability. The former mech- 
anism is particularly suitable to model evolving networks 
whose future topology is likely to be determined essen- 
tially by their past one, without any additional 'external' 
information. The latter mechanism is better adapted to 
model static networks where the topological properties 
are essentialy determined by some additional, 'physical' 
information of non-topological nature but intrinsically re- 
lated to the role played by each vertex in the network. 

With suitable implementations, both types of models 
were shown to successfully reproduce a range of nontriv- 
ial topological properties [1,2,4,5]. However, the main 
question remains whether the basic hypotheses of the 
models can be tested satisfactorily in each particular case 
study, so that one can decide which theoretical mecha- 
nism best captures the empirical organizing principle of 
the specific system. While the effective presence of pref- 
erential attachment has been detected in some evolving 
networks such as the Internet, collaboration and coau- 
thorship networks [6,7], no analogous test for the hidden 
variable mechanism has been so far performed on those 
networks which are the natural candidates for it. Indeed, 
in most cases the hidden variable is regarded as a hypo- 



thetical quantity with no clear physical interpretation [5] 
or as a statistical one equal to the desired degree (or fu- 
gacity) [8] of a vertex. In the present Letter, we provide 
complete evidence that the hidden variable mechanism 
correctly reproduces all the relevant topological proper- 
ties of a real network: the World Trade Web (WTW in 
the following), or the network formed by the trade re- 
lationships between all world countries [9]. Our main 
result is that the Gross Domestic Product (GDP in the 
following) can be identified with the fitness variable that, 
once a form of the connection probability is introduced, 
completely specifies the expected topological properties 
of the WTW. Remarkably, all expected trends are in ex- 
cellent agreement with the empirically observed ones. 

The properties of the WTW corresponding to the year 
2000 were already studied in ref. [9] and reveal a complex 
organization of international trade channels. Moreover, 
the degree (number of trade partners) of a country was 
found to be correlated with its per capita GDP, even if a 
number of exceptions to this trend were found [9]. Here 
we present a finer analysis based on a different, more de- 
tailed data set [10] which reports all trade relationships 
for each country, and not only a limited number of them 
as in ref. [9]. This allows us to extend the previous anal- 
ysis to a wider region, with interesting results. Moreover, 
besides the values of per capita GDP, the present data set 
also reports the population sizes of all world countries, 
so that the corresponding total GDP (which we denote 
by Wi for each country i) can be easily obtained. Since 
we expect that the total GDP is more closely related to 
the trade activity of a country than the per capita GDP 
is, the knowledge of Wi allows us to consider its value 
as the natural candidate to be identified with the hidden 
variable associated to each country i in the WTW. In 
other words, since the WTW is defined by the exchange 
of wealth between its vertices, we identify the fitness (po- 
tential ability of being connected to other vertices [4]) of 
a vertex with its wealth, which here is measured by the 
GDP of the corresponding country. In order to have an 
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adimensional quantity we define the fitness Xi of a coun- 
try i as its relative GDP: 



Wi 



(1) 



where clearly < a;^ < 1 V«. In the following we will 
show the results of the analysis of the WTW and GDP 
for the year 1995, even if the data set covers a longer time 
interval [10] and we analysed several different years ob- 
taining analogous results. For the 1995 data, the number 
of countries equals iV = 191 and the number of links is 
L = 16, 255. Note that we are referring to the undirected 
version of the WTW (also studied in ref. [9]), where two 
countries i and j are connected if a nonzero trade flow 
exists in any direction between them (z exports to j, j ex- 
ports to i or both conditions apply). The reason for this 
choice is that, as we show below, a suitable form of the 
connection probability can be introduced which correctly 
reproduces the undirected version of the network and at 
the same time has an interesting economic meaning. 
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FIG. 1. Cumulative fitness distribution (circles) of world 
countries and power law with slope -1 (solid line). 

In the fitness model [4], once the set of fitness values 
{xi}iLi is fixed, the expected topological properties of 
the network only depend on the statistical distribution 
p{x) of the fitness and on the probability f{xi,Xj) that 
the vertices i and j are connected. The cumulative fit- 
ness distribution p>{x) = p{x')dx' for our system is 
shown in fig.l and is found to display a power-tail of the 
form py{x) oc x^~'^ , corresponding to a tail in the proba- 
bility density of the form p{x) oc x~'^ . A power-law with 
exponent 1 — t = — 1 (corresponding to r = 2) is shown 
as a reference for the large x range. This Pareto tail is 
common to many other wealth distributions [11]. 

Now we must specify a form of the connection proba- 
bility. Consistently with our expectation that the GDP 
of a country represents its potential ability to develop 
trade relationships with other countries, we interpret the 
GDP (up to a proportionality constant) as the desired 
degree of a country in the same spirit of ref. [8] . Then, 
using the same arguments of the cited reference to require 
that no two vertices have more than one edge connecting 



them, we end up with the following form of the connec- 
tion probability: 



f{xi,Xj) 



1 -|- SxiXj 



(2) 



where (5 > is the only free parameter of the model. 
The idea leading to the above choice was first proposed 
in ref. [12] to detect nontrivial patterns of a real network 
by comparing it with a suitably randomized one with 
the same degree sequence. This procedure also allows to 
check if some properties of the network are merely due to 
the impossibility of having multiple links [8,12]. Once the 
dependence on x is made explicit, the requirement of ref. 
[12] translates to eq. (2). With such a choice, once the 
set of fitness values {xi]f^i is specified, all realizations 
of the network with the same degree sequence {ki}f^i 
occur in the ensemble of all possible ones with the same 
probability [8,12]. In other words, all the networks ob- 
tained through permutations of the links that leave the 
degree sequence unchanged are equiprobable. From an 
economic point of view, this means that once the GDP 
of world countries is specified, the probability of having 
a particular realization of the WTW only depends on the 
degree sequence, which here is the vector of the numbers 
of trade partners. 

The form of the connection probability f{xi,Xj) com- 
pletely specifies the topological properties of the network. 
For instance, one zeroth-order property is the expected 
number of links, which is given by: 



Xj) 



(3) 
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and determines the expected mean degree (k) — 2L/N of 
the network (throughout the Letter, the expected value 
of a quantity will be denoted by a tilde). This can be 
used in order to tune the parameter 5 to a value such 
that L equals the actual number L of links in the net- 
work. We find that this is obtained when i5 = 807V^, 
and all the following results of the present Letter corre- 
spond to this single choice of the parameter. Note that 
in eq.(3) and in what follows we avoid the use of inte- 
grals since the discrete sums can be directly computed 
on the empirical values of Xi without introducing an ap- 
proximated analyitical form for p{x) which would result 
in loss of information. In this way we completely focus 
on the process yielding the expected quantities once the 
values {xi}fL^ are given as an input. 

A first- order property that can be used to test the pre- 
dictions of the model is the dependence of the expected 
degree ki of each vertex i 



ki ^ ^ fi'^i: '• 



(4) 



on the fitness Xi. In fig. 2 we compare the predicted 
behaviour of ki(xi) (solid line) with the empirical one 
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FIG. 2. Comparison of the observed and expected depen- 
dence of the degree k on the fitness x (the circles are obtained 
through a logarithmic binning of raw data). The asymptotic 
trends fc = — 1 and k ~ 5x are also shown. 

of ki{xi) (circles) and find a close agreement between 
them. As expected, richer countries have more connec- 
tions than poor ones. More precisely, ki is an increasing 
function of Xi such that ki ^ N — 1 when Xi oo. 
This corresponds to the requirement that no two edges 
are connected by more than one link, and closely reflects 
the results of ref. [8]. In the opposite limit Xi fa one 
has f{xi, Xj) « SxiXj and therefore ki « Sxi Xj w Sxi 
(both asymptotic trends are shown as a dashed and a 
dotted line respectively) . Let us stress that the predicted 
values ki are obtained only through the empirical GDP 
data without any additional information about the topol- 
ogy of the network. By contrast, the actual values ki are 
obtained only from the WTW data, which is an indepen- 
dent source of information. This remarkable comment 
extends to every result that will be presented below. 

Once the expected values ki are given, one can plot the 
corresponding statistical distribution. In fig. 3 we com- 
pare the empirical cumulative degree distribution (cir- 
cles) with the predicted one (solid line), and we find 
that they are in excellent agreement. As a comparison, 
note that the empirical degree distribution presented in 
ref. [9] extends to a narrower region and diplays a less 
pronounced cut-off for large k. This is probably due 
to the fact that in that case the data set reports only 
the 40 most relevant links for each country, even if by 
a symmetry argument the authors recover many (but 
not all) missing observations. This leads the authors 
to fit the data with a power law P>{k) oc k^^'' with 
1— 7 = — 1.6±0.1 (shown as a reference in fig. 3). Here 
instead we find that the power-law region is only a small 
part of the whole degree distribution and we conclude 
that the WTW is not a scale-free network. The sharp 
cut-off for large k corresponds to the saturation effect 
k{x) N — 1 ioY large x shown in fig. 2, and it is in 
accordance with the theoretical results of ref. [8] . 

In ref. [9] it was shown that the WTW is also char- 
acterized by higher-order properties which require the 
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FIG. 3. Cumulative degree distributions P>(fc): empirical 
(circles) and predicted (solid line). A power law with slope 
— 1.6 is also shown (dashed line). 

knowledge of correlations between vertex degrees. For 
instance, the average nearest neighbour degree K™{k) 
(defined as the mean degree K™^ of the neighbours of 
a vertex i, averaged over all vertices with the same de- 
gree ki) is a second-order topological property requiring 
the knowledge of the two-vertices conditional probabil- 
ity P{k'\k) that a vertex with degree k is connected to 
a vertex with degree k' . The value of JiT"" predicted by 
the fitness model [4,5] is 

j^nn ^ ^j^i ^k^jf ,Xj)f{Xj,Xk) 
ki 

In fig. 4 we plot the values of K™ versus ki for the empir- 
ical network and compare them with the predicted ones 
K™^. The accordance is again very good, apart from a 
slight underestimation of K™ for small k. The decreas- 
ing trend of K""-{k) signals the presence of disassorta- 
tivity, or anticorrelation between vertex degrees [13,14]. 
Poorly connected countries are on average connected to 
well connected (richer) ones. By contrast, in an uncor- 
related network K"^ is constant. Remarkably, while in 
ref. [8] it was shown that the mechanism suggested in ref. 
[12] and based on the connection probability of eq.(2) ex- 
plained only some of the observed disassortativity of the 
Internet, here the model succeeds in providing a com- 
plete description of the observed correlation structure. 
It should be stressed that, despite a power-law fit of the 
form K™{k) oc k~'^ is commonly proposed [9,14], exten- 
sive numerical simulations [8] showed that the functional 
form of K™{k) generated by the mechanism discussed 
here is different, even if in the large k range it appears to 
approach a power-law behaviour. Consistently with such 
results, the WTW of ref. [9] only displayed a power-law 
behaviour in a very narrow range. For this reason we 
avoid to fit our data with a power-law, and we prefer to 
plot them in linear scale as in fig. 4. 
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FIG. 4. Average nearest neighbour degree K^" and clus- 
tering coefficient C versus the degree k for the real (circles) 
and model (solid line) webs. 



deviations from this average behaviour, such as the de- 
tection of geographical preferences due to reduced trans- 
portation costs and the identification of countries with 
more (or less) trade partners than the mean expected 
value. We believe that the study of such deviations is 
an important point to be addressed in the future. In re- 
gards to the more general problem of network modelling, 
our results strongly support the hypothesis of the hidden 
variable or fitness model [4], which is likely to capture 
the topological organization of many other real networks. 
The possibility of identifying the fitness variable with a 
physical quantity is a significant step forward, since in 
previous studies its distribution was chosen quite arbi- 
trarily in order to yield the observed degree distribution 
of the network in an ad hoc fashion. Instead, once the fit- 
ness distribution is fixed by observation as in the present 
case, the topological properties only depend on the func- 
tional form of the connection probability and the choice 
of the latter allows a deeper physical understanding of 
the network formation process. 



Finally, we focus on the clustering coefficient Ci, which 
is a third- order topological property defined as the con- 
nected fraction of neighbour pairs of a vertex i. In 
principle, the evaluation of this quantity requires the 
knowledge of the three-vertices conditional probability 
P{k', k"\k) that a vertex with degree k is simultaneously 
connected to two vertices with degrees k' and k" [5]. 
However, the fitness model is completely specified by the 
pairwise connection probability f{xi,Xj), and as a re- 
sult the quantity P(fc', k"\k) can be simply expressed as 
P{k',k"\k) = P{k'\k)P{k"\k) [5]. The expected cluster- 
ing coefficient Ci is given by 

^ _ Y.j^iY.k^J,^fix^^Xj)f{Xj,Xk)f{Xk,Xi) 
[l^i 1] 

Figure 4 shows the dependence of the clustering coef- 
ficient on the degree (no power-law fit is provided for 
the same reason as above). The agreement between the 
predicted and the observed behaviour is again excellent. 
This strongly suggests that the real WTW is well ap- 
proximated by a network with only second-order non- 
trivial correlations, and that the third-order structure 
can be obtained in terms of the first- and second-order 
one. As noted in ref. [9], the decrease of C{k) points 
out the presence of hierarchy in the network [15], or the 
tendency of low-degree countries to trade with tightly in- 
teracting partners, and that of higher-degree countries to 
trade with more loosely interacting partners. 

We stress again the very intriguing result that all the 
predicted properties of the WTW shown in this Letter are 
obtained by exploiting only the set of GDP values, and re- 
produce almost perfectly the actual properties computed 
on the independent WTW data. By providing a good 
average description of the global trade activity, our anal- 
ysis also provides a basis for the characterization of the 
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